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This paper proposes new parametric model adequacy tests for possibly nonlinear and nonsta- 
tionary time series models with noncontinuous data distribution, which is often the case in applied 
work. In particular, we consider the correct specification of parametric conditional distributions 
I in dynamic discrete choice models, not only of some particular conditional characteristics such as 

moments or symmetry. Knowing the true distribution is important in many circumstances, in par- 
1^ . ticular to apply efficient maximum likelihood methods, obtain consistent estimates of partial effects 

! and appropriate predictions of the probability of future events. We propose a transformation of data 

which under the true conditional distribution leads to continuous uniform iid series. The uniformity 
and serial independence of the new series is then examined simultaneously. The transformation 
can be considered as an extension of the integral transform tool for noncontinuous data. We de- 
rive asymptotic properties of such tests taking into account the parameter estimation effect. Since 

■ transformed series are iid we do not require any mixing conditions and asymptotic results illustrate 
^ , the double simultaneous checking nature of our test. The test statistics converges under the null 

with a parametric rate to the asymptotic distribution, which is case dependent, hence we justify a 

■ parametric bootstrap approximation. The test has power against local alternatives and is consistent. 
, The performance of the new tests is compared with classical specification checks for discrete choice 

models. 

Keywords: Goodness of fit, diagnostic test, parametric conditional distribution, discrete choice 
models, parameter estimation effect, bootstrap. 
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1 Introduction 



Dynamic choice models are important econometric tools in applied macroeconomics and finance. 

These are used to describe the monetary policy decisions of central banks (Hamilton and Jorda, 
2002; Basu and de Jong, 2007), for recession forecasting (Kauppi and Saikkonen, 2008; Startz, 2008) 
and to model the behavior of agents in financial markets (Rydberg and Shephard, 2003). In the 
simplest framework, a binary dynamic model explains the value of an indicator variable in period t, 
Yf G {0, 1} , in terms of an information set available at this period. Then Yt conditional on Qt is 
distributed as a Bernoulli variable with expectation pt = E {Yt\flt) = P (Yt = l\^t) = F {t^i) where 
TTj = TT ) summarizes the relevant information and -F is a cumulative probability distribution 
function (cdf) monotone increasing. Typical specifications of the link function F are the standard 
normal cdf, and the logistic cdf. 

We can describe the observed values of It as Yj = 1 {Y^ > 0} where Y^ is given by the latent 
variable model 

Y* = TTt + £t 

and £t ^ F = are iid observations with zero mean. 

In a general specification ttj is a linear combination of a set of exogenous variables Xt observable 
in t, but not necessarily contemporaneous, plus lags of Yt and ttj itself, 

TTt = ao + a (L) TTt + 5 (L) Yt + X^/?, 

where 5 (L) = 5iL -\ h 5gL« and a (L) = aiL H h apL^. When g = 0, p = 1 and = $ this 

leads to the dynamic probit model of Dueker (1997), 

TTt = TTO + 6xYt-l + X'tli, 

and if the roots of 1 — a (L) are out of the unit circle, ttj can be represented in terms of infinite lags 
of Yt and Xt . 

Many nonlinear extensions have been considered in the literature, such as interactions with lags 
of Yt, to describe the state of the economy in the past, 

TTt = TTo + 5^Yt-i + X'tP + {Yt-iXt)' 7 

or with the sign of other variables in Xt, both stressing different reaction functions in several regimes 
defined in terms of exogenous variables at period t. Other specifications consider heteroskedasticity 
corrections, so that Var(e() = {Q,t) , for example a two regimes conditional variance, Var(£() = 
[Yt-,) . 

In the general ordered discrete choice model, the dependent variable takes J + 1 values in a 
set J , and the parametric distribution P(l^ = j\^t) can be modeled using the unobserved latent 
continuous dependent variable Y^*. In the typical case where Yt = j if /^j-i < Yt < Hj for j G J , 
J = {0, 1, 2, J} and £t ~ F^, with = — oo and jij = oo, we have that 

P{Yt = j\nt) = FeiHj - TTt) - F,{HJ_1 - TTt) 

with ao = 0. 

Forecasting is one of the main uses of discrete choice models. In that case for the calculation of 
predictions it might be necessary to resource to recursive methods when 6 (L) ^ 0. However in almost 
all situations parameters are unknown, but conditional maximum likelihood (ML) estimation is 
straightforward given the binomial or discrete nature of data, with typically well behaved likelihoods 
and asymptotic normal estimates if the model is properly specified. The existence, representation 
and probability properties of these models have been studied under general conditions by de Jong 
and Woutersen (2011), who also report the consistency and asymptotic normality of ML estimates 
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when the parametric model is correct. However, if not, estimates will be inconsistent and predictions 
can be severely biased. 

This leads to the need of diagnostic and goodness-of-fit techniques, which should account for the 
main features of these models, discrete nature and dynamic evolution. The first property entails 
nonlinear modeling and renders invalid many methods specifically tailored for continuous distribu- 
tions. Though the latent disturbance St is continuous and with a well specified distribution, it is 
unobservable. Simulation methods could be used to estimate the distribution of such innovations, 
but we follow an alternative route by continuing" the discrete observations Yt, so that they have 
a continuous and strictly increasing conditional distribution in [— 1, J] given This distribution 
inherits the dependence on a set of parameters and on a conditional information set and can serve 
as a main tool to evaluate the appropriateness of the hypothesized model. 

Conditional distribution specification tests are often based on comparing parametric and non- 
parametric estimation as Andrews' (1997) conditional Kolmogorov test, or on the integral transform 
(see Bai 2003, Corradi and Swanson 2006). The former approach is developed for different data 
types, while the latter can be used only for data with continuous distribution. The integral trans- 
form does not require strong conditions on the data dependence structure, so it is very useful in 
testing dynamic models. However, applying the integral transform to noncontimious data will not 
bring to uniform on [0, 1] series, and therefore this approach can not be applied directly to dynamic 
discrete choice models. To guarantee that adequacy tests based on the integral transform enjoy nice 
asymptotic properties we propose the following procedure: first, make data continuous by adding 
a continuous random noise and then apply the modified conditional distribution transformation to 
get uniform iid series. 

The first step can be called the continuous extension of a discrete variable which has been em- 
ployed in different situations. For example Ferguson (1967) uses some type of extension for simple 
hypothesis testing, Denuit and Lambert (2005) and Neslezhova (2006) use it to apply a copulas 
technique for discrete and discontinuous variables. The second step is the probability integral trans- 
form (PIT) of the continued variables, which we will call randomized PIT. Resulting uniform iid 
series can be tested using Bai (2003) or Corradi and Swanson (2006) tests. However, in some cases 
these tests can not distinguish certain alternatives, so we also propose test based on comparing joint 
empirical distribution functions with the product of its theoretical uniform marginals by means of 
Cramer-von Mises or Kolmogorov-Smirnov type statistics, developed by Kheifets (2011) for contin- 
uous distributions. 

In a general setup, we do not know the true parameters, while the integral transform using 
estimated parameters does not necessary provide iid uniform random variates. Hence asymptotic 
properties and critical values of the tests with estimated parameters have to be addressed. The 
estimation effect changes the asymptotic distribution of the statistics and makes it data dependent. 
Andrews (1997) proves that parametric bootstrap provides correct critical values in this case using 
linear expansion of the estimation effect, which arises naturally under the ML method. The idea of 
orthogonal projecting the test statistics against the estimation effect due to Wooldridge (1990) has 
been used in parametric moment tests, see Bontemps and Meddahi (2006). The continuous version 
of the projection, often called Khmaladze (1981) transformation, was employed in the tests of Koul 
and Stute (1999) to specify the conditional mean, and of Bai and Ng (2001), Bai (2003), Dclgado and 
Stute (2008) to specify the conditional distribution. These projection tests are not model invariant 
since they require to compute conditional mean and variance derivatives, and also projections may 
cause a loss in power. In this paper we apply a bootstrap approach instead. In the case of ordered 
choice models an extensive Monte Carlo comparison of specification tests has been done by Mora 
and Moro (2008) in a static cross section context. They study two types of tests based on moment 
conditions and on comparison of parametric and nonpar ametric estimates. 

Despite that there is some work on nonstationary discrete data models, cf. Phillips and Park 
(2000), we stress stationary situations, but some ideas could be extended to a more general set up as 
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far as the conditional model provides a full specification of the distribution of the dependent discrete 
variable. 

The contributions of this paper are following: 1) a new specification test for dynamic discon- 
tinuous models is proposed, 2) we show that the test is invariant to the choice of distribution of 
the random noise added, 3) parameter estimation effect of the test is studied, 4) under standard 
conditions we show the asymptotic properties of such tests, and 5) since asymptotic distribution 
is case dependent, critical values can not be tabulated and we prove that a bootstrap distribution 
approximation is valid. 

The rest of the paper is organized as follows. Section 2 introduces specification test statistics. 
Asymptotic properties and bootstrap justification provided in Section 3. Monte Carlo experiments 
are reported in Section 4. Section 5 concludes. 

2 Test statistics 

In this section we introduce our goodness-of-fit statistics. Suppose that a sequence of observations 
{Yi,Xi), (12,-^^2), {Yt,Xt) is given. Let fit = {Xt,Xt-i, . . . ;Ft_i,lt_2, . . .} be the information 
set at time t (not including Yj). We consider a family of conditional cdf's F{y\Qt,d), parameterized 
hy 9 £ Q, where G C is a finite dimensional parameter space. We could allow for nonstationarity 
by permitting the change in the functional form of the cdf of It using subscript t in Ff. Our null 
hypothesis of correct specification is 

Hq : The conditional distribution of Yt conditional on fit is in the parametric family F{y\il.t,d) 
for some Oq £ @. 

For example, for dynamic ordered discrete choice model the null hypothesis would mean that 
36*0 G 0, Vj = 0, . . . , J, P {Yt = j\ fit) = Pj(fit, 6*0)1 i-e. that all conditional probabilities are in a 
given parametric family. 

For further analysis, we assume that the support of the conditional distributions -F(y|fit,^) is 
a finite set of nonnegative integers {0, . . . , J} and F(y|fit, 9) = YljKy^FUl^t, S), where P_f is the 
probability function at the discrete points. 

The first step is to obtain a continuous version of Y. For any random variable Z ^ F^ with 
support in [0, 1] and Fz continuous (but not necessary strictly increasing) define the continued by Z 
version of Y, 

Y'^ = Y + Z -I. 
Then the distribution of the continued version of Y is 

Ft (y|fit) = P (yt < = F ([y]|fit) + F,(y - [y]) P ([y] + l|fit) , (1) 

which is strictly increasing on [— 1, J] . The typical choice for Z is the uniform in [0, 1] , so that 

Ft (2/lfit) = F([y]|fit) + {y- [y]) P([y] + l|fit). (2) 

The binary choice case renders Ft (y|fit) = {y—[y]) (1 — Pt) for y G [—1,0) and Ft (y|fit) = (1 — pt) + 
{y ~ [y])Pt for y G [0,1] . Note, that Ft coincides with F in the domain of F. We state next an 
"invariance property": for our purpose, it does not matter how to continue Y and what distribution 
Fz of the noise Z to add. The unit support of Z is needed to get a simple expression for Ft in ([T]), 
otherwise the resulting distribution will be a convolution Ft (y|fif) = X^/=o {y + ^ — j) P (j|fit)- 
Continuation idea has been used to deal with discrete distributions, for example, to work with 
copulas with discrete marginals as in Denuit and Lambert (2005). 

The following proposition generalizes results about the probability integral transform. 
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Proposition 1 (a) Under Hq random variables Ut = F'^ {Y^\Q.t,6o) are iid uniform; (h) Invariant 
property of randomized PIT: realizations of Ut are the same for any distribution in (Ql) both under 
Hq and under the alternative. 

Part (a) is a property of usual PIT with a continuous distribution F"^. Part (b) that reahzations of 
Ut are the same, means the following. Consider continuations of Yt by arbitrary Z F^ and uniform 
Zu ~ Fjj. Fix realizations {yt}, {zt} and {z^t} from respective distributions. If z^t = Fz{zt), then 

F^""^ {yt + zt- l\^t, 9o) = F^^^iyt + zut - l\^t, ^o), 

where F'^^'-' stresses dependence of on Fz in ([T]), i^t^t/ ^s F'^ in ([2]), continued by uniform, 
and Qt denotes here realized past. Therefore, although a continued variable and its distribution 
F"^ depends on Fz, F"^ {Y^\Q,t,9Q) is not and we can always use uniform variables Z for continuation 
without affecting any properties of tests based on Ut- 

Now we can use the fact that under the null hypothesis Ut = F'^ {Y^\Q,t,9o), t = 1, . . . ,T, are 
uniform on [0,1] and iid random variables, so that P{Ut-i < ri, Ut-2 ^ f2,---, Ut-p < rp) = rir2 ■ ■ ■ r., 
for r = (ri, . . . , rp) G [0, 1]^. This motivates us to consider the following empirical processes 



p, 



VpTir) 



1 ^ 

y 

VT - iP + 1) ti;ti 



Y[ ^i^t-j < rj) - rir2 ...rp 



If we do not know either {{Yt,Xt),t < 0}, we approximate Ut with JJt = Fl{Y^\Q,t,9) where 9 
is an estimator of 9q and the truncated information set is (it = {Xt,Xt-i, . . . , Xi;Yt-i,Yt-2, ■ ■ ■ , Yi} 
and write 



VpTir) 



1 



rir2 ...rr. 



(3) 



and 



DpT = T{VpT{r)) 

for any continuous functional r(-) from ^°^([0, 1]*'), the set of uniformly bounded real functions on 
[0, 1]^, to R. In particular we use the Cramer-von Misses and Kolmogorov Smirnov test statistics 



CvM 
pT 



VpT{r)'^dr or Dpr^ = max 



VpTir) 



'[0,1]P ' ' [0.1]'' 

One further possibility is to test for j-lag pairwise independence, using the process 

T 



(4) 



1 



[liUt < ri)IiUt-j < rs) - rira" 



(5) 



t=j+i 

and corresponding test statistics -D^j^ and -D^^j, say. 

We can aggregate across p or j summing possibly with different weights obtaining generalized 
statistics 

T-l T-l 

ADPt = Hp)DpT, or ADJt = Hj)D2T,j. (6) 
p=i j=i 

For p = 1, -Dfj? delivers a generalization of Kolmogorov test to discrete distributions. Usually 
this test captures general deviations of marginal distribution but lacks power if only dynamics is 
misspecified. In particular, it does not have power against alternatives where Ut are uniform on 
[0,1] but not independent. For general p, VpT delivers a generalization of Kheifets (2011) to discrete 
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distributions. This test should capture both deviations of marginal distribution and deviations in 
dynamics. 

A more direct approach is based in Box-Pierce (1970) type of statistics, we could consider 

m 

BPUm:=Tj2PT,u{j)\ 

j=l 

m = 1,2, . . . , and pT,u (j) are the sample correlation coefficients of the Uls at lag j. Noting that Ut 
should be uniform continuous iid random variables under the null of correctly specified model, but 
might be correlated under alternative hypothesis of wrong specification, BPUm is a good basis to 
design goodness-of-fit tests. This idea is related to the tests of Hong (1998). Alternatively, we can 
check autocorrelations of Gaussian residuals *^{Ut) 

m 

BPNm -tJ^Ptmu) Of > 

and normality of $([/*) with Jarque-Bera test. In addition we can check autocorrelations of discrete 
innovations, 

^ ^ Yt-E[Yt\nt] 
which are just the usual standardized probit residuals. We can define 

m 

BPDm ■.= Tj2PT,e{jf 

and other statistics based on autocorrelations of squares of different types of residuals. The asymp- 
totic distribution of these statistics can be approximated by chi square distributions when the true 
parameters 9o are known. Unlike tests based on empirical process, these tests can not capture some 
alternatives, for example if misspecification involves only higher order moments. 

Parameter estimation affects the asymptotic distribution of these statistics, as well as that of 
those tests based on the empirical distribution of the Uj-S. There are different bootstrap and sampling 
techniques to approximate asymptotic distribution, see for example Shao and Dongsheng (1995), 
Politis, Romano and Wolf (1999). Since under Hq we know the parametric conditional distribution, 
we apply parametric bootstrap to mimic the Hq distribution. We introduce the algorithm now for 
statistics r(V2r)- 

1. Estimate model with initial data {Yt,Xt), t = 1,2,...,T, get parameter estimator 6, get test 
statistic T{V2t)- 

2. Simulate Y* with F{-\^*,9) recursively for t = 1,2, T, where the bootstrap information set 

is o: = (Xi,Xt_i,...,y;_i,y;_2,...). 

3. Estimate model with simulated data Y^* , get 9* , get bootstrapped statistics r(y2T)- 

4. Repeat 2-3 B times, compute the percentiles of the empirical distribution of the B boostrapped 
statistics. 

5. Reject Hq if T{V2t) is greater than the corresponding (1 — a)th percentile. 

We will prove that T{V2rp) has the same limiting distribution as T(V2t)- Bootstrapping other 
statistics is similar. 
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3 Asymptotic properties of specification tests 



In this section we derive asymptotic properties of the statistics based on V2t- We start with the 
simple case when we know parameters, then study how the asymptotic distribution changes if we 
estimate parameters. We provide analyses under the null, under the local and fixed alternatives. 
We first state all necessarily assumptions and propositions, then discuss them. 

Let II • II denote Euclidean norm for matrices, i.e. \\A\\ = y^ti{A'A) and for s > 0, B{a,e) is an 
open ball in with the center in the point a and the radius e. In particular, for some M > 
denote Bt = B [Oq.MT-^/'^) = {6* : | |6l - 6lo| | < MT-^/"^}. For any discrete distributions G and F, 
with probability functions Pg and Pi?, and r G [0, 1] define 

d{G,F,r) = G {F-^ (r)) - F {F-^ (r)) 
We have d (F, F, r) = 0, but d (G, F, r) is not symmetric in G and F. 

Assumption 1 Uniform boundedness away from zero: Ve > 0, 35 > 0, such that |-F(0|r2t, 0)| > e 
and \F{j\nt,e) - F{j - l\nt,9)\ > e for J = 1, . . . , J uniformly in € B{eo,5). 
Assumption 2 Smoothness with respect to parameters: 

(2.1) 

E max sup m&x\F(y\nt,u)-F(y\nt,eo)\=o(T-'^/A. 

(2.2) VM G (0, oo), VM2 G (0, 00) and V(5 > 

1 ^ 

max— =^ sup \F {y\nt,u) - F {y\nt,v)\ = Op{l) . 

y V J ||«-!,||<M2r-i/2-* 

(2.3) VM G (0,00), there exists a uniformly continuous (vector) function h{r) from [0,1]^ to R^, 
such that 

T 



sup sup 

vGBt rG[0,l]2 



-l=Y^htir,v)-hiryVT{eo-v) 

t=2 



where 



ht{r,v) = d{F{-\^t-i,eQ),F{-\at~i,v),r2)n 

+d {F {■\nt, 60) , F {■\nt, v) , n) / (F {Yt-i\nt-i. Oo) < ra) . 

Assumption 3 Linear expansion of the estimator: when the sample is generated by the null 
Ft{y\0,t,6o), the estimator 9 admits a linear expansion 



1 ^ 

t=i 



(7) 



with Ef, {I {Yt, fit) \nt) = and i ZLi ^ (Yt, ^t) i {Yt, fit)' ^4 ^. 

Dynamic probit/logit and general discrete choice models considered in Introduction can easily 
be adjusted to satisfy all these assumptions. Discrete support allows a simple analytical closed form 
of conditional distribution of continued variable by any continuous random variable on unit support 
as in ([2]). Assumption 1 in particular requires that F{0\Qt,9) and F{j\il.t,9) — F{j — l\Qt,6) for 
j = 1, . . . , J are bounded away from zero uniformly around 9q. To study parameter estimation effect 



7 



we need to assume some smoothness of the distribution with respect to the parameter in Assumption 
2 and a Hnear expansion of the estimator in Assumption 3. Note, the smoothness of the distribution 
with respect to the parameter is preserved after continuation, therefore Assumption 2 is similar to 
continuous case in Kheifets (2011); local Lipschitz continuity or existence of uniformly bounded first 
derivative of the distribution w.r.t. parameter is sufficient. For bootstrap we will need to strengthen 
Assumption 3 (see Assumption 3B below), although both conditions are standard and satisfied for 
many estimators, for example for MLE. Note, that to establish the convergence of the process V2T 
(with known ^0) under the null (the following Proposition 2), we do not need these assumptions. 

We now describe the asymptotic behavior of the process V2t('') under Hq. Denote by " =^ " 
weak convergence of stochastic processes as random elements of the Skorokhod space D ([0, 1]^). 

Proposition 2 Under Ho 

V2T =^ V200, 

where V2oo(^) is bi-parameter zero mean Gaussian process with covariance 

Covv2^{r,s) = (ri A si)(r2 A S2) + {ri A S2)^2Si + (r2 A si)riS2 — 3rir2SiS2. 

To take into account the estimation effect on the asymptotic distribution, we use a Taylor ex- 
pansion to approximate V2T{r) with V2T{r), 



V2T{r) = V2T{r) + Vf(e- eo)'h{r) + Op(l) 



uniformly in r. To identify the limit of V2T{r), we need to study limiting distribution of VT{9 — 9o), 
using the expansion from Assumption 3. Define 

CHr,s,e) = E]^^^T^^^^Y^^^^^ j ^ ^Y^U^Yu^t) ) 

and let iy2oo{f), '^'00)' be a zero mean Gaussian process with covariance function C(r, s,^o) = 
lim^^-^oo Cxir, s, Oq). Dependence on 9 on right hand side (rhs) comes through Ut and I (•, •). 

Suppose the conditional distribution function H{y\VLt) is not in the parametric family F{y\Q.t,9) 
but has the same support. For any Tq G {0, 1, 2, } and T > Tq define conditional on J7( conditional 
df 



GT{yPt,9) = ( 1 - ^ ) F{yPt,9) + ^H{yPt). 



Now we define local alternatives: 

Hit'- Conditional cdf of Yj is equal to GT{y\^t^ 9q) with Tq 7^ 0. 

Conditional cdf GT{y\^tiGo) allow us to study all three cases: Hq \i Tq = 0, Hit if T = 
To, To + IjTq + 2, ... and Tq 7^ and Hi if we fix T = Tq. In the next proposition we provide the 
asymptotic distribution of our statistics under the null and under the local alternatives. 

Proposition 3 a) Suppose Assumptions 1-3 hold. Then under Hq 

r(F2r) A r(F2oo), 

where V2oQ{r) = V2oQ{r) - h{ryipoo- 

b) Suppose Assumptions 1-3 hold. Then under Hit 



r(t>2T) 4 r (^200 + \/TQk - ^/TQ^'h) 
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where 




+ d{H i-int) , F {-IQt, Bo) , r-i) / (F (yt_i|f]t_i, ^o) < ra)} , 

and 




Under Gt, the random variables Ut = F^(Y^\Q,t, ^o) are not anymore iid, instead C/; = G^y/|J]i, 9o) 
are miiform iid. The first term in k{r) controls for the lack of uniformity of Ut (and it is similar 
to Bai's (2003) k{r)), it is zero when Ut are uniform. The second term in k{r) adds control for 
independence of Ut, cf. Kheifets (2011). 

Under the alternative we may have also that ([7]) is not centred around zero, since Egj. {i (Yt, 0,t) \^lt) = 
^^Eh {£{Yt,Q,t) \^t), therefore ^ may be nonzero, which stands for information from estimation. 
This term does not appear in Bai (2003) method, since his method projects out the estimation effect. 

For the case of the one parameter empirical process, we can provide the following corollary, which 
is similar to Bai (2003) 's single parameter results. 

Corollary 4 a) Suppose Assumptions 1-3 hold. Then under Hq 

r(vi^(.))4r(V2oo(-,i)), 

where Vioo(-) = Vioo{-) - h{-, l)''0oo and lioo(-) = V2oo{-, 1)- 
b) Suppose Assumptions 1-3 hold. Then under Hit 

r(ViT (•)) ^ r(Vioo (•) + V%ki (•) - v^/i(., i)'o, 

where for r G [0, 1] 

1 ^ 

ki{r) = phm TfY^diH {-Iflt) , F ^,00) , r) . 

Note then that tests based on Vit are not consistent against alternatives for which ki = and 
h{-, 1) = but /c / or /i(-, 1) 7^ on some set of positive measure. 

We will justify our bootstrap procedure now, i.e. we prove that r(V^'^) has the same limiting 
distribution as r(V2T)- We say that the sample is distributed under {9t : T > 1} when there 
is a triangular array of random variables {Yrt '■ T > l,t < T} with {T,t) element generated by 
F{-\il,Tt,OT), where Qxt = {Xt-i, Xt-2, ■ ■ ■ , Yxt-i, YTt-2, • • •)• Similar arguments can be applied to 
other statistics. 

Assumption 3B For all nonrandom sequences {9t ■ T > 1} for which 6t ^ Oq, we have 

1 ^ 

VT{e-eT) = -= J]^(yrt,OT«) + op(i), 

t=i 

under {dr ■ T > 1}, where E [£ (Yrt, f^r*) \^Tt] = and 

1 ^ 

t=i 

Note that the function i {■,■) now depends on Ot and is assumed to be the same as in Assumption 
3. We require that estimators of close to Oq points have the same linear representation as the 
estimator of 9q itself. 
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Table 1: Different scenarios for Monte Carlo experiments. 





DGP 


Null 


1 


probit static 


probit static 


2 


probit dynamic 


probit dynamic 


3 


probit interactions probit interactions 


4 


logit static 


probit static 


5 


chi2 static 


probit static 


6 


logit dynamic 


probit static 


7 


chi2 dynamic 


probit static 


8 


logit interactions 


probit dynamic 


9 


chi2 interactions 


probit dynamic 


10 


logit interactions 


probit static 


11 


chi2 interactions 


probit static 



Proposition 5 Suppose Assumptions 1, 2 and 3B hold. Then for any nonrandom sequence {Ox '■ 
T > 1} for which 6t — )• 9o, under {9t : T > 1}, 

r(V2T(r))4r(T/2oo(r)). 

4 Monte Carlo Simulation 

In this section we investigate the finite sample properties of our bootstrap tests using Monte Carlo 
exercise. We use a simple dynamic Probit model with one exogenous regressor with autoregressive 
dynamics. We consider three specifications of dynamics 

Static model : vr^ = ttq + (3Xt, 
Dynamic model : vr^ = ttq + + PXt, 

Dynamic model with interactions : nt = ttq + 6iYt-i + ^lYf-iXt + (3Xt, 71 = —2/3, 

where in all specifications Xt follows an AR(1) process, 

Xt = aiXt^i + et, et ~ UN (0, 1) , 

and we set vro = 0, /3 = 1, 5i = 0.8, ai = 0.8. 

We try 11 different scenarios of data generating processes (DGP) and null hypotheses (see Table 
[1]). In the first three we study the size properties of static, dynamic and dynamic with interactions 
probit models. Other scenarios check power when dynamics and/or marginals are misspecified. We 
take logit and (xf — l) /2^/^ as alternative distributions. We use sample sizes T = 100 (Table [2]), 
300 (Table [3]) and 500 (Table H]) with 1000 replications. To estimate the Bootstrap percentages 
of rejections we use a Warp bootstrap Monte Carlo (see Giacomini, Politis and White, 2007) for 
all considered test statistics. For tests based on "continued" residuals we consider one-parameter 
(p = 1) and two-parameter empirical processes {p = 2) with j = 1 and j = 2 lags and Cramer-von 
Misses (CvM) and Kolmogorov-Smirnov (KS) criterions. To make the results more readable, we 
denote them as CvMo = CvMi = L>g^^f , CvM2 = L>g^J and KSo = D^^ , KSi = D^^^, 

KS2 = D2X2- We consider Box-Pierce type tests for Gaussian and discrete residuals with m = 
1,2,25. We also check normality of Gaussian residuals with a bootstrapped Jarque-Bera test (JB). 
The results of empirical process tests with further lags j = 3, 4, 5 and correlation tests on uniform 
residuals do not provide additional information and are omitted. 
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Table 2: Percentage of rejections of test statistics with T = 100. 



CvMq CvMi CvM2 KSo KSi KS2 BPNi BPN2 BPN25 JB BPDx BPD2 BPD25 



1T0% 

5% 
1% 



3.5 
0.3 



7.4 
4.3 
0.9 



10.4 

4.3 

0.4 



8.4 10.1 9.2 
3.9 4.8 4.7 
0.5 1.1 1.7 



9.5 
4.6 
1.5 



9.6 
3.7 
0.8 



9.3 

3.8 
0.3 



8.3 10.1 

4.4 5.5 
2.1 0.8 



10.6 

5.1 

0.6 



3.4 
0.3 



10% 

5% 
1% 



7.9 
3.0 
0.0 



8.3 
3.6 
0.4 



8.7 
4.0 
0.1 



7.0 

2.8 
0.8 



9.6 
4.9 
1.2 



9.2 
4.5 
1.0 



9.0 
6.0 
0.9 



10.6 

4.0 

1.3 



7.0 

2.1 
0.3 



11.2 
5.9 
1.5 



9.5 
4.0 
0.4 



10.7 

4.4 

1.1 



12.8 

6.1 

0.7 



[10% 

5% 
1% 



8.9 
4.1 
0.1 



10.0 

4.1 

0.1 



9.5 
3.9 
0.2 



7.7 10.6 
3.6 4.9 
1.1 0.5 



9.4 
5.0 
0.8 



10.1 

5.5 
1.2 



11.3 

5.5 
1.1 



8.9 

4.5 
0.5 



10.7 

5.4 
1.1 



9.2 

5.5 
0.6 



9.2 

3.8 
0.8 



10.1 

5.4 
0.5 



110% 
5% 

1% 



8.1 
3.9 

0.5 



9.0 
4.6 

0.4 



7.6 
3.5 

0.3 



8.4 8.9 
3.6 4.1 

0.6 0.6 



9.9 
3.7 

0.6 



7.2 
3.5 
1.2 



4.1 

0.7 



7.5 
3.6 
0.6 



9.9 
3.0 
0.5 



9.0 
5.1 
1.0 



9.2 
4.6 
1.9 



9.0 
4.1 
0.7 



10% 
5% 

1% 



10.4 
4.9 
0.5 



9.5 
6.1 
0.9 



10.2 
5.6 
0.3 



12.0 10.1 
5.9 5.2 
0.8 1.2 



11.1 
5.7 
0.3 



9.2 
5.7 
1.0 



11.5 
6.3 
1.0 



10.7 

6.1 

0.7 



20.3 
12.6 

4.3 



8.0 
4.6 

1.8 



7.5 
3.7 

1.6 



9.2 
4.8 
0.9 



110% 
5% 
1% 



9.5 
4.6 

0.4 



11.0 
4.9 
0.5 



7.6 
3.5 

0.8 



9.2 9.8 9.3 
3.5 5.2 4.6 
0.5 1.4 0.9 



19.1 
10.7 

2.9 



15.4 
9.0 
2.3 



11.7 

6.6 

0.8 



11.0 
4.7 
0.9 



43.0 
29.4 
11.0 



35.3 
20.5 
5.7 



16.8 
9.4 
2.9 



10% 
5% 

1% 



10.3 
4.8 
0.1 



10.9 
5.2 
1.5 



9.4 
4.7 
0.1 



9.2 10.0 
3.9 4.7 
1.2 1.3 



9.3 
5.2 
0.5 



28.3 
20.6 

9.4 



26.4 
16.4 
6.0 



14.5 
8.5 
2.6 



13.7 
7.7 

2.3 



60.0 
47.1 

26.0 



50.6 
37.0 

16.4 



24.6 
16.7 
5.6 



110% 
5% 

1% 



9.7 
4.0 
0.8 



9.2 
3.7 

0.8 



13.7 
7.8 

2.5 



9.9 10.1 13.0 
3.6 5.5 7.8 
0.8 1.1 1.3 



14.0 
6.5 

0.6 



26.6 
18.8 
6.4 



16.2 
10.1 

2.3 



9.9 
3.8 
0.3 



46.2 
36.9 
17.1 



57.4 
45.1 
27.6 



30.1 
18.6 
5.0 



10% 
5% 

1% 



14.4 
8.9 
0.9 



16.9 
10.0 

1.8 



29.1 
21.1 

3.9 



16.0 20.6 34.4 
9.9 12.5 18.5 
1.4 4.2 3.8 



18.1 
11.2 

3.4 



55.7 
48.4 
26.9 



33.5 
23.1 

11.6 



20.0 
12.4 
3.1 



79.0 
72.9 

58.1 



82.2 
81.0 
77.2 



64.9 
59.8 

43.8 
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10% 
5% 

1% 



8.6 
2.8 
0.6 



14.7 

8.5 

1.4 



18.1 
9.5 

2.1 



7.6 15.5 13.0 
3.8 7.6 6.6 

0.4 1.3 0.5 



28.0 
17.5 

5.3 



42.0 
29.0 

12.4 



21.3 
12.9 

4.4 



9.2 
3.9 

0.3 



50.1 
35.7 

22.4 



79.8 
69.8 

45.6 



43.6 
30.4 
10.7 



11 



10% 
5% 
1% 



9.0 
3.4 
0.2 



28.1 
17.6 
6.1 



33.6 
19.8 
4.2 



8.1 
3.3 
0.2 



29.2 28.4 
17.1 11.8 
3.4 1.4 



53.1 
40.7 
23.2 



85.1 
76.3 
57.1 



60.3 
43.1 
22.0 



5.3 
0.6 



72.0 
61.6 
40.8 



99.9 
99.7 
98.4 



94.2 
90.5 
72.3 
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Table 3: Percentage of rejections of test statistics with T = 300. 







CvMo CvMi CvM2 KSo KSi KS2 BPNi 


BPN2 




JB 


BPDi 


BPD2 


BP Dor. 




10% 


8.3 


9.2 


9.2 


9.1 


9.0 


10.4 


8.0 


8.3 


8.6 


8.0 


8.3 


8.2 


10.2 


1 


5% 


4.1 


4.7 


4.6 


5.2 


4.8 


4.6 


3.6 


3.9 


4.4 


2.7 


4.2 


3.1 


4.8 




1% 


0.7 


1.0 


0.6 


0.8 


0.6 


0.4 


0.6 


0.7 


1.1 


0.6 


0.6 


0.6 


1.0 




10% 


9.3 


8.9 


10.2 


9.3 


9.8 


11.6 


7.6 


10.0 


9.7 


9.6 


9.0 


6.9 


7.4 


2 


5% 


5.5 


4.4 


5.9 


5.2 


4.9 


6.4 


3.8 


4.6 


4.8 


3.8 


3.7 


3.6 


3.2 




1% 


1.0 


1.1 


0.9 


1.4 


1.1 


0.9 


0.5 


0.7 


1.4 


0.7 


0.5 


0.2 


0.6 




10% 


8.5 


12.3 


8.7 


8.7 


12.2 


9.9 


8.1 


9.9 


10.1 


9.0 


10.1 


9.4 


13.5 


3 


5% 


4.5 


5.1 


5.1 


3.3 


5.4 


5.3 


4.4 


5.2 


5.9 


4.2 


5.3 


4.2 


5.1 




1% 


1.1 


1.0 


1.5 


1.2 


0.6 


0.9 


0.9 


1.0 


1.3 


0.5 


0.6 


1.2 


1.5 




iU/O 


9.9 


iU.Z 


\).Z 


9.5 


1 A 
10.0 


A 
9.0 


O.D 


y .0 


1 n n 

iU.U 


1 1 9 
±±.z 


0.0 


Q R 

y .0 


0.0 


4 


o/o 


/I Q 

4.0 


4.y 


A a 
4.0 


K Q 
0.0 


b.Z 


A n 

4.y 


A A 

4.4 


0.1 


A n 

4.U 


D.i 


Q Q 
0.0 


O.Z 


O.D 




i /o 


0.9 


1.1 


0.6 


0.9 


1.3 


1.1 


0.5 


u.o 


1 n 


n 7 
u. / 


U.o 


U.o 


n 
u.y 




10% 


16.5 


15.1 


14.9 


15.8 


14.9 


14.4 


11.8 


11.2 


8.7 


41.4 


6.0 


7.2 


9.0 


5 


5% 


8.8 


7.9 


8.3 


9.6 


7.5 


8.5 


5.0 


6.1 


4.1 


30.4 


4.6 


6.3 


6.7 




1% 


1.6 


2.0 


1.8 


1.8 


2.2 


1.4 


0.9 


1.4 


1.0 


9.6 


3.0 


3.1 


2.9 




10% 


8.8 


15.4 


11.3 


9.0 


13.7 


10.1 


38.3 


29.9 


16.0 


9.6 


79.1 


69.0 


29.2 


6 


5% 


4.7 


9.8 


4.8 


5.4 


7.9 


6.1 


25.1 


21.8 


8.5 


5.8 


65.2 


58.1 


19.3 




1% 


0.5 


2.0 


0.3 


0.6 


1.5 


1.5 


11.8 


5.8 


1.5 


0.7 


43.7 


36.6 


6.8 




10% 


11.8 


12.2 


14.4 


12.3 


8.8 


13.2 


42.5 


35.1 


15.6 


24.5 


55.6 


47.1 


24.0 


7 


5% 


7.0 


5.9 


9.4 


6.2 


4.1 


6.3 


31.2 


25.0 


9.9 


16.3 


44.6 


39.5 


15.0 




1% 


1.1 


1.3 


1.9 


1.2 


1.2 


2.0 


15.3 


9.2 


2.6 


3.3 


35.3 


20.2 


5.8 




10% 


12.5 


18.3 


44.9 


13.0 


17.2 


44.0 


19.5 


67.0 


30.0 


12.6 


91.4 


96.5 


72.0 


8 


5% 


6.3 


12.5 


31.6 


6.4 


10.3 


28.9 


10.6 


55.7 


19.3 


6.2 


85.6 


94.0 


59.5 




1% 


1.0 


2.6 


9.8 


0.9 


2.7 


8.5 


2.7 


31.3 


7.4 


1.4 


71.4 


87.7 


37.3 




10% 


32.9 


42.5 


81.0 


29.5 


46.9 


88.8 


34.3 


92.0 


71.3 


24.8 


99.0 


99.7 


98.8 


9 


5% 


17.3 


25.9 


65.2 


19.7 


36.3 


80.2 


25.2 


88.8 


64.7 


17.6 


98.0 


99.6 


97.8 




1% 


3.0 


7.8 


36.3 


3.9 


14.3 


56.4 


12.3 


77.1 


42.1 


4.0 


94.5 


99.1 


95.6 




10% 


8.7 


33.1 


44.9 


9.7 


28.3 


33.7 


51.9 


83.2 


49.4 


9.3 


81.7 


99.6 


84.6 


10 


5% 


4.4 


22.4 


30.0 


4.7 


17.7 


21.4 


36.8 


74.5 


35.2 


5.5 


69.5 


98.8 


78.7 




1% 


1.1 


9.0 


11.1 


0.8 


6.8 


5.2 


18.2 


54.0 


17.0 


1.2 


37.9 


97.1 


48.1 




10% 


8.6 


46.2 


76.7 


9.1 


46.7 


76.9 


63.8 


99.5 


89.7 


U.O 


81.5 


100.0 


100.0 


11 


5% 


4.3 


32.8 


63.9 


4.3 


36.1 


67.8 


51.0 


98.8 


81.8 


5.6 


68.2 


100.0 


100.0 




1% 


0.4 


11.7 


37.3 


0.4 


18.4 


42.8 


31.7 


94.9 


63.9 


0.9 


39.4 


100.0 


99.3 
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Table 4: Percentage of rejections of test statistics with T = 500. 



CvMq CvMi CvM2 KSo KSi KS2 BPNi BPN2 BPN25 JB BPDx BPD2 BPD25 



1T0% 

5% 
1% 



10.2 
4.7 
0.5 



8.1 
4.5 
1.0 



9.2 
4.9 
0.7 



9.8 
4.2 
0.7 



8.6 
3.9 
0.6 



8.0 
4.3 
1.0 



11.9 
6.0 
0.6 



11.7 

5.4 
0.8 



11.1 

5.6 
0.8 



11.3 10.1 
5.3 5.2 
1.2 1.1 



8.6 
4.7 
0.4 



9.8 
4.7 
1.1 



10% 

5% 
1% 



9.1 
4.3 
0.7 



8.0 
4.9 
0.8 



8.3 
4.4 
0.8 



10.2 
4.7 
0.5 



8.6 
3.8 
0.2 



10.7 

4.1 
0.6 



11.6 
5.6 
0.6 



11.5 

4.8 
1.4 



8.4 
3.9 
0.5 



9.2 8.6 
4.6 4.3 
0.5 1.4 



9.9 
5.7 
0.8 



10.6 
5.6 
0.8 



[10% 
5% 
1% 



9.1 
4.1 
0.7 



8.9 
3.9 
1.1 



9.9 

3.1 
1.0 



9.2 8.6 8.5 

4.8 4.5 4.9 
1.0 0.4 0.4 



10.0 
5.7 
1.1 



11.6 

7.1 
1.0 



10.7 

5.4 
1.9 



11.0 10.7 

5.2 4.6 
1.5 1.5 



10.2 

5.3 
2.0 



10.0 

4.8 
1.6 



110% 
5% 

1% 



10.7 
5.2 

0.4 



8.9 
4.3 

0.8 



10.9 
4.8 

1.0 



10.7 10.8 
5.2 3.9 
0.7 0.6 



4.1 

0.4 



11.5 
6.1 
0.7 



9.2 
3.5 
1.1 



10.2 
5.2 
0.7 



7.8 10.0 
4.4 5.1 
0.7 1.4 



10.4 
6.7 

1.3 



12.3 
7.2 

1.9 



10% 
5% 

1% 



17.1 
11.4 

3.6 



16.8 
10.0 
3.7 



17.0 
9.7 
3.2 



20.0 19.4 17.8 
13.4 12.0 12.1 

4.5 4.9 3.3 



11.8 
4.4 
1.3 



12.1 
5.5 
1.7 



8.5 
3.8 
0.7 



53.2 7.9 
45.1 5.6 

23.6 3.4 



8.0 
5.4 
3.7 



14.0 
9.1 

5.3 



110% 
5% 
1% 



8.7 
5.3 
0.6 



17.1 
8.9 
1.5 



9.8 
4.9 
1.0 



10.2 15.5 8.9 
4.9 6.9 3.4 

0.9 1.1 0.8 



46.1 
32.9 

17.2 



36.2 
27.3 

8.9 



17.6 
11.8 

2.4 



9.3 
3.7 

0.6 



88.7 
82.9 

53.0 



81.2 
69.0 

46.8 



42.0 
29.5 

8.5 



10% 
5% 

1% 



15.2 
8.1 

2.2 



11.9 
5.6 
1.3 



14.8 
10.8 
2.8 



13.3 11.3 15.9 
7.5 4.8 8.0 
2.5 1.7 3.0 



39.9 
28.5 
12.0 



36.3 
28.0 
9.4 



18.0 
10.6 
3.0 



38.0 53.7 
27.9 41.0 

8.2 18.2 



42.6 
34.4 
19.5 



21.5 
16.5 

9.4 



110% 
5% 

1% 



22.6 
12.6 

3.4 



34.0 
23.3 
7.6 



90.1 
83.8 
53.9 



25.1 35.9 92.9 
14.7 25.9 86.5 

4.0 10.4 65.0 



23.7 
16.7 

5.1 



97.6 
95.9 
87.6 



76.7 
65.1 

44.7 



10.0 99.9 

5.0 99.7 

1.1 99.2 



100.0 
100.0 

99.9 



99.6 
99.2 

97.1 



10% 
5% 

1% 



56.1 
39.9 

13.4 



73.2 
59.9 
38.7 



98.8 
97.0 

88.5 



58.3 78.1 99.6 
41.9 69.9 98.8 
16.1 48.2 95.6 



62.6 
51.2 
31.1 



99.5 
99.3 

98.6 



96.9 
95.4 

91.4 



30.4 100.0 

20.5 100.0 
10.7 100.0 



100.0 
100.0 

100.0 



100.0 
100.0 

99.8 
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10% 
5% 

1% 



9.9 
5.2 
1.1 



58.4 
43.1 

14.3 



90.5 
82.2 

59.4 



10.3 52.8 88.7 
5.0 38.2 79.8 

0.6 14.1 39.6 



74.8 
65.2 

44.5 



99.6 
99.3 

97.3 



93.6 
89.4 
78.0 



7.9 98.7 
4.5 95.9 

1.3 86.0 



100.0 
100.0 

100.0 



100.0 
100.0 

99.7 



11 



10% 
5% 
1% 



10.0 
3.6 
0.5 



74.7 
62.5 
38.5 



99.1 
96.7 
87.5 



11.3 81.3 98.0 
4.8 71.3 95.6 
0.8 33.1 80.7 



86.8 
78.9 
64.0 



100.0 
100.0 
100.0 



100.0 
99.9 
99.9 



9.4 99.6 
4.6 98.6 
1.3 87.1 



100.0 
100.0 
100.0 



100.0 
100.0 
100.0 
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Now we discuss the performance of empirical process based tests in comparison with traditional 
correlation tests. For T = 100 almost all test statistics are slightly undersized (cases 1-3). The 
situation improves with larger T, and CvM statistics approach faster to nominal rates than KS. 
Overall, empirical size at T = 500 is very good. The situation with power is not unambiguous. 
No test can capture static logit alternative to the null hypothesis of static probit model even at 
T = 500 (cased]). On the other hand, when static alternative to the null hypothesis of static 
probit is considered (case[5|), there is some power at T = 300 which improves with T = 500 for all 
empirical process based tests. Since under the null and under the alternative we have static models, 
correlation tests do not have power. Normality test (JB) is doing well only in the latter case. When 
there is a slight dynamic misspecification added to logit (case[6|), CvMi and KSi improve, but 
when it is added to our tests and JB doing worse (casefTj). Correlation tests, on the contrary 
display power against these dynamic alternatives. When the alternative has dynamic interactions, 
and the null is a dynamic probit (cases [8] and [9|) , all tests (but JB for logit) are doing well, and even 
better if higher lags are taken into account. Finally, when dynamic interactions are taken versus 
static model (cases fTOl and [TT]) . power is very good, and increases when more lags are considered. 
Exceptions are "marginal tests" CvMq, KSq and JB. To summarize, dynamic misspecification can 
be captured well by empirical process statistics and correlation tests. Misspecification in marginals, 
on the contrary, can not be distinguished at all by correlation tests but empirical process statistics, 
possibly multi-parameter, still work, although further research in improving power of these tests is 
needed. 

To develop our omnibus type tests we introduce additional continuous noise. An important 
question is the effect of this noise on the power of the tests. Since correlation tests based on discrete 
residuals BPDj do not use additional noise, while correlation tests based on continuous residuals 
BPNj do, we can use the difference in rejection rates between these sets of statistics under dynamic 
misspecification as an indirect measure of the effect of the introduced noise, though correlation tests 
are not consistent against static alternatives. From our Monte Carlo simulations we see that for all 
scenarios we consider, correlation tests based on discrete residuals perform better, indicating that 
some power losses may indeed be attributed to the introduced noise. To overcome this problem, we 
plan to develop tests for discrete models based on alternative transformations of the data without 
introducing additional noise, but still consistent against a wide range of nonparametric alternative 
hypotheses. 

5 Conclusion 

In this paper we have proposed new tests for checking goodness-of-fit of conditional distributions 
in nonlinear discrete time series models. Specification of the conditional distribution (but not only 
conditional moments) is important in many macroeconomics and financial applications. Due to the 
parameter estimation effect, the asymptotic distribution depends on the model and specific param- 
eter values. We show that our parametric bootstrap provides a good approximation to asymptotic 
distributions and renders feasible and simple tests. Monte Carlo experiments have shown that tests 
based on empirical processes have power if misspecification comes from dynamics. If misspecification 
affects marginals alone, correlation tests are inconsistent, while tests based on empirical processes 
have some power. Comparing to the continuous case, we may conclude that there is a reduction of 
power due to the additional noise which distribution is known under the alternative too. 
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Appendix 



Proof of Proposition [T]. Part (a) is a property of dynamic PIT with a continuous conditional 
distribution , the proof can be found in Bai (2003). Part (b) follows from the fact that (omitting 
dependence on t, Qt and 9) 

Ft(y + Z-1) = F{[Y + Z-l]) + Z^P{[Y + Z]) 
= F{Y -1) + Z^F(Y), 

where 

Z^ = F,{Y + Z - I - [Y + Z - I]) = F, (Z) 

is uniform for any Z ^ F^ continuous and with [0, 1] support, by the usual static PIT property. 
Therefore, although a continued variable Y'^ and its distribution depends on F^, F'^{Y'^) does 
not. □ 
Proof of Propositions 2. Assumption 1 in Kheifets (2011) is satisfied automatically after 
applying continuation defined in (l2|), therefore Proposition 1 of Kheifets (2011) holds. □ 
Proof of Propositions 3. Follows from Kheifets (2011), we need only to check that Assumption 
2 in Kheifets (2011) is satisfied. 

Let r = F'^ [y] . Note that [y] = F~^{r) but F {[y]) = F (F^^(r)) equals r only when y = [y]. 
The inverse of i^^ is 

) ^ ) ^2^J+p([y] + l) ^yJ+ + P(b] + 1) 



F~Hr) + 



r-F(F-i(r)) 
P(F-i(r) + l) ■ 



Note also that (r — F ([y])) / P ([y] + 1) = y — [y] G [0, 1]. Take distribution G with the same support 
as F. We have different useful ways to write d {G, F, r): 



d{G,F,r) = r/t(r)-r = Gt (^(^t) \r)^ - r = G^ (y) - r 

= G{[y])-F{[y]) + {y-[y]){FG{[y] + l)-FF{[y] + l)) (9) 
= G([y] + 1)-F([y] + 1) 

+ (y - [y] - 1) (Pg {[y] + 1) - Pf {[y] + 1)) (10) 
= G {F^^ (r)) - F {F-\r)) 

^¥70WTT) + ^) - + • ^''^ 

Thus, noting that P (•) is bounded away from zero, we have that Assumption 2 in this paper is 
sufficient for Assumption 2 in Kheifets (2011): 

(K2.1) 

E sup sup sup 

t=l,..,TueBT re[0,l] 

(K2.2) VM G (0,oo), VM2 G (0,oo) and V(5 > 



4ir,u,0o)-r\=o(T-'/^^ 



sup ^ V sup 4 (r, u, Oo) - Vt (r, V, ^0) = Op (1) • 

re[0,l] VI ^^-^ ||«-,;||<M2T-i/2-* 
u,vSBj' 



15 



(K2.3) VM G (0,oo), VM2 G (0,oo) and \/5 > 



1 

sup (^M,6'o) - 7?J (s,M,6'o) 

|r-s|<M2r-i/2-« V J ^^-^ u£Bt 



(K2.4) VM E (0,00), there exists a uniformly continuous (vector) function h{r) from [0,1]^ to R^, 
such that 



sup sup 

u&Bt re[0,l]2 



t=2 



Op(l). 



where 



ht = (r?ti {r2,u, 60) - ra) n + {ri,u, Oq) - / (y^liln) < ra) . 



For Part a), take d (^F {■\nt,Oo) , F (^-{ntj^y Then (K2.1), (K2.2), (K2.4) follow from (2.1), (2.2) 

and (2.3) because of representation ([IT]) . If we compare ^ and (fTUj) we see that (i(-) is not only- 
continuous in r, but piece-wise linear, so (K2.3) is satisfied automatically. 

For Part b), take d ^Gt {-10,1, 60) ,F ^-Ifi^,^^^ and use the additivity of d(-) in the first argu- 
nts: 



ments 



d(^GTmt,eo),F(^-\nt,e)) = (^i-^^d(^F{-\nt,0o),F (^-{Qt 

+^d(^H{-\nt),F(^-\nt,i 



Proof of Propositions 5. The proof is similar if we consider 
{Ot-T> 1}. 



□ 

d(^F{-\nt,eT),F(^-\nt,eT)) under 

□ 
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